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VANM2 13.00 1 AUS PATENT 
IDENTIFICATION OF BIOLOGICAL (MICRO) ORGANISMS BY DETECTION OF 

\ THEIR HOMOLOGOUS NUCLEOTIDE SEQUENCES ON ARRAYS 

5 Field of the inventxon 

The present invention is in the JA&ld of 
^ diagnosis and is related to a method anj^^TtT comprising 

^ ^ X reaqents and means for the ident i£ieart!Ton (detection and/or 

T\ .. , > quantification) of (micro) organisms among other ones having 
10 homologous nucleotide^sequences by identification of their 
nucleotj^^^sequences, after amplification by a single primer 

The invention is especially suited for the 
identification and/or quantification of (micro) organisms of 
15 the same genus or family or for the detection and/or 
quantification of related genes in a specific (micro) organism 
present in a biological sample. 



Background of the invention 

20 The development of the biochips technology 

allows the detection of multiple nucleotide sequences 
simultaneously in a given assay and thus allow the 
identification of the corresponding organism or part of the 
organism. Arrays are solid supports containing on their 

25 surface a series of discrete regions bearing capture 
nucleotide sequences (or probes) that are able to bind (by 
hybridisation) to a corresponding target nucleotide 
sequence (s) possibly present in a sample to be analysed. If 
the target sequence is labelled with modified nucleotides 

30 during a reverse transcription or an amplification of said 
sequence, then a signal can be detected and measured at the 
binding location. Its intensity gives an estimation of the 
amount of target sequences present in the sample. Such 
technology allows the identification and/or quantification of 

35 genes or species for diagnostic or screening purpose. 
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The Company Affymetrix Inc. has developed a 
method for direct synthesis of oligonucleotides upon a solid 
support, at specific locations by using masks at each step of 
5 the processing. Said method comprises the addition of a new 
nucleotide on a growing oligonucleotide in order to obtain a 
desired sequence at a desired location. This method is 
derived from the photolithographic technology and is coupled 
with the use of photoprotect ive groups, which are released 

10 before a new nucleotide is added (EP-A1 - 0476014 , 
US-A-5,445, 934, US-A-5 , 143 , 854 and US-5 , 510 , 270) . However, 
only small oligonucleotides are present on the surface, and 
said method finds applications mainly for sequencing or 
identifying a pattern of positive spots corresponding to each 

15 specific oligonucleotide bound on the array. The 
characterization of a target sequence is obtained by 
comparison of such pattern with a reference. Said technique 
was applied to the identification of Mycobacterium 
tuberculosis rpoB gene (W097/29212 and W098/28444) , wherein 

20 the capture nucleotide sequence comprises less than 30 
nucleotides and from the analysis of two different sequences 
that may differ by a single nucleotide (the identification of 
SNPs or genotyping) . Small capture nucleotide sequences 
(having a length comprised between 10 and 20 nucleotides) are 

25 preferred since the discrimination between two 
oligonucleotides differing in one base is higher, when their 
length is smaller. 

The lack of sensitivity of the method is 
illustrated by the fact that it cannot detect directly 

30 amplicons resulting from genetic amplification (PCR) . A 
double amplification with primer (s) bearing a T3 or T7 
sequences and then a retrotranscript ion with a RNA 
polymerase. These RNA are cut into pieces of about 40 bases 
before being detected on an array (example 1 of WO 97/29212) . 

35 However, long DNA or RNA fragments hybridize very slowly on 
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capture probes present on a surface. Said methods are 
therefore not suited for the detection of homologous 
sequences since the homology varies along the sequences and 
so part of the pieces could hybridize on the same capture 
5 probes. Therefore, a software for the interpretation of the 
results should be incorporated in the method for allowing 
interpretation of the obtained d^ta. 

However, for gene expression array which is 
based on the cDNA copy of mRNA the same problem is 

10 encountered when using small capture probe arrays: the rate 
of hybridisation is low. Therefore, the fragments are cut 
into smaller species and the method requires the use of 
several capture nucleotide sequences in order to obtain a 
pattern of signals which attest the presence of a given gene 

15 (WO97/10364 and W097/27317) . Said cutting also decreases the 
number of labelled nucleotides, and thus reduces the obtained 
signal. In this case, the use of long capture nucleotide 
sequences give a much better sensitivity to the detection. In 
the many gene expression applications, the use of long 

20 capture probes is not a problem, when cDNA to be detected 
originates from genes having different sequences, since there 
is no cross-reactions between them. Long capture nucleotide 
sequences give the required sensitivity, however, they will 
hybridize to other homologous sequences. 

25 Using membranes or nylon supports are proposed 

to increase the sensitivity of the detection on solid support 
by incorporation of a spacer between the support and the 
capture nucleotide sequences. Van Ness et al . (Nucleic Acids 
Research, Vol.19, p. 3345, 1991) describe a 

30 poly (ethyleneimine) arm for the binding of DNA on nylon 
membranes. The European patent application EP-0511559 
describes a hexaethylene glycol dervivative as spacer for the 
binding of small oligonucleotides upon a membrane. When 
membranes like nylon are used as support, there is no control 

35 of the site of binding between the solid support and the 
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oligonucleotides and it was observed that a poly dT tail 
increased the fixation yield and so the resulting 
hybridization (W089/11548) . Similar results are obtained with 
repeated capture sequences present in a polymer 
5 (US 5, 683, 872) . 

Guo et al . (Nucleic Acids Research 22, 5456, 
1994) teach the use of poly dT of 15 bases as spacer for the 
binding of oligonucleotides on glass with increased 
sensitivity of hybridization. 

10 The document WO99/16780 describes the detection 

of 4 homologous sequences of the gene femA on nylon strips. 
However, no data on the sensitivity of the method and the 
detection is presented. In said document, the capture 
nucleotide sequences comprise between 15 and 350 bases with 

15 homology less than 50% with a consensus sequence. 

The publication of Anthony et al. (Journal of 
clinical microbiology, Vol.38 nr. 2, p. 7817-8820) describes 
the use of a membrane array for the discrimination with low 
sensitivity of homologous sequences originated from a several 

20 related organisms. Targets to detect are rDNA amplified from 
bacteria by consensus PCR and the detection is obtained on 
nylon array containing capture nucleotide sequences for said 
bacteria and having the capture nucleotide sequences having 
between 20 and 30 bases which are covalently linked to the 

25 nylon, and there is no control of the portion of the sequence 
which is available for hybridization. 

lHm« nf thm invontioa .Su^ft^ 3 • -M^Vi )( 

The present invention aims to provide a new 
30 method and device to improve microarrays or biochips 
technology for the easy identification (detection and/or 
quantification) of a large number of (micro) organisms or 
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portions of (micro) organisms having homologous nucleotide 
sequences . 

A further aim of the invention is to provide 
such method and device which are based upon a simplified 
technology requiring the use single primer (s) in an 
amplification step and which allow the identification 
(detection and/or quantification) of a specific target 
sequence by the identification and/or recording of a single 
spot signal upon said microarray, said signal resulting only 
from the specific binding of th£ target sequence with its 
corresponding capture sequence. 
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Definitions 

The terms "nucleic acid, oligonucleotide, 
array, probe, target nucleic acid, bind substantially, 
hybridising specifically to, background, quantifying" are the 
ones described in the international patent application 
W097/27317 incorporated herein by reference. 

The terms "nucleotide triphosphate, nucleotide, 
primer sequence" are those described in the document 
WO00/72018 and PCT/BE00/00123 incorporated herein by 
references . 

The terms "Homologous sequences" and "consensus 
sequence" are described in the European patent application 
00870055.1 incorporated herein by reference. 



■ fi ttmrnagy of tho inv<?nfeion » 

The inventors have .discovered-' * that it 
possible to drastically simplify the J^ietitif ication of one or 

30 several (micro) organisms amor^^many other ones having 
homologous sequences b^., -combining a single amplification 
using common prim&r-" "pair and an identification of the 
possible {mpefo) organism (s) by detecting and possibly 
recording^'upon an array the presence of a single signal 

35 re>«^ting only from a binding between a capture sequence and 
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its corresponding target sequence and correlating J^lae 
presence of said detected target sequence txS^ the 
identification of a genetic sequence specific of said 
X (micro) organism (s) . This means that tha method and device 

\¥Z 5 according to the invention will allow the easy 

y* identification/detection of a specific sequence among other 

homologous sequences and possibly its quantification 
(characterisation of ..-€&e number of copies or presence of said 
organisms in au-biological sample) of a target sequence, said 
10 target sequence having a nucleotide sequence specific of said 
Jjiticro) organisms . 

Such identification may be obtained directly, 
after washing of possible contaminants (unbound sequences), 
by detecting and possibly recording a single spot signal at 
15 one specific location, wherein said capture nucleotide 
sequence was previously bound and said identification is not 
a result of an analysis of a specific pattern upon the 
microarray as proposed in the system of the state of the art . 
Therefore, said method and device do not necessarily need a 
20 detailed analysis of said pattern by an image processing and 
a software analysis. 

This invention was made possible by discovering 
that target sequences can be discriminated from other 
homologous ones upon an array with high sensitivity by using 
25 bound capture nucleotide sequences composed of at least two 
parts, one being a spacer bound by a single and 
advantageously predetermined (defined) link to the support 
(preferably a non porous support) and the other part being a 
specific nucleotide sequence able to hybridise with the 
30 nucleotide target sequence. 

Furthermore, said detection is greatly 
increased, if high concentrations of capture nucleotide 
sequences are bound to the surface of the solid support. 

The present invention is related to the 
35 identification of a target sequence obtained from a 
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biological (micro) organism or a portion thereof, especially a 
gene possibly present in a biological sample from at least 4 
other homologous (micro) organisms or a portion thereof, said 
other (micro) organisms could be present in the same 
5 biological sample and have homologous nucleotide sequences 
with the target. 

Said identification is obtained firstly by a 
genetic amplification of said nucleotide sequences (target 
and homologous sequences) by common primer pairs followed 

10 (after washing) by a discrimination between the possible 
different target amplified. Said discrimination is 
advantageously obtained by hybridization upon the surface of 
an array containing capture nucleotide sequences at a given 
location, specific for a target specific for each 

15 (micro) organism to be possibly present in the biological 
sample and by the identification of said specific target 
through the identification and possibly the recording of a 
signal resulting from the specific binding of this target 
upon its corresponding capture sequence at the expected 

20 location (single location signal being specific for the 
target) . 

According to the invention, the preferred 
method for genetic amplification is the PCR using two anti- 
parallel consensus primers which can recognise all said 
25 target homologous nucleotide sequences. 

The method according to the invention further 
comprises the step of correlating the signal of detection 
(possibly recorded) to the presence of : 

• specific (micro) organism (s) , 

30 • genetic characteristics of a sequence, 

• polymorphism of a sequence, 

• diagnostic predisposition or evolution (monitoring) of 
genetic diseases, including cancer of a patient 
(including the human) from which the biological sample 

35 has been obtained. 
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Therefore, said (micro) organisms could be 
present in any biological material including genetic material 
obtained (virus, fungi, bacteria, plant or animal cell, 
including the human) . The biological sample can be also any 
5 culture medium wherein microorganisms, xenobiotics or 
pollutants are present, as well as such extract obtained from 
a plant or an animal (including a human) organ, tissue, cell 
or biological fluid (blood, serum, urine, etc) . 

The method according to the invention can be 

10 performed by using a specific identification (diagnostic 
and/or quantification) kit or device comprising at least an 
insoluble solid support upon which are bound single stranded 
capture nucleotide sequences (preferably bound to the surface 
of the solid support by a direct covalent link or by the 

15 intermediate of a spacer) according to an array with a 
density of at least 4, preferably at least 10, 16, 20, 50, 
100, 1000, 4000, 10 000 or more, different single stranded 

capture nucleotide sequences/cm 2 insoluble solid support 
surface, said single stranded capture nucleotide sequences 

20 having advantageously a length comprised between about 3 0 and 
about 600 bases (including the spacer) and containing a 
sequence of about 10 to about 60 bases, said sequence being 
specific for the target (which means that said bases of said 
sequence are able to form a binding with their complementary 

25 bases upon the sequence of the target by complementary 
hybridisation) . Preferably, said hybridisation is obtained 
under stringent conditions (under conditions well-known to 
the person skilled in the art) . 

In the method and kit or device according to 

30 the invention, the capture nucleotide sequence is a sequence 
having between 16 and 600 bases, preferably between 30 and 
300 bases, more preferably between 40 and 150 bases and the 
spacer is a chemical chain of at least 6,8 nm long (of at 
least 4 carbon chains) , a nucleotide sequence of more than 30 

35 bases or is nucleotide derivative such as PMA. 
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The method, kit and device according to the 
invention are particularly suitable for the identification of 
a target, being preferably biological (micro) organisms or a 
part of it, possibly present in a biological sample where at 
5 least 4, 12, 15 or even more homologous sequences are 
present. Because of the high homology, said sequence can be 
amplified by common primer (s) so that the identification of 
the target is obtained specifically by the discrimination 
following its binding with the corresponding capture 

10 nucleotide sequence, previously bound at a given location 
upon the microarray. The sensitivity can be also greater 
increased if capture nucleotide sequences are spotted to the 
solid support surface by a robot at high density according to 
an array. A preferred embodiment of the invention is to use 

15 an amount of capture nucleotide sequences spotted on the 
array resulting in the binding of between about 0.01 to about 
5 pmoles of sequence equivalent /cm 2 of solid support surface. 

The kit or device according to the invention 
may also incorporate various media or devices for performing 

20 the method according to the invention. Said kit (or device) 
can also be included in an automatic apparatus such as a high 
throughput screening apparatus for the detection and/or the 
quantification of multiple nucleotide sequences present in a 
biological sample to be analysed. Said kit or apparatus can 

25 be adapted for performing all the steps or only several 
specific steps of the method according to the invention. 

In the method, the kit (device) or apparatus 
according to the invention, the length of the bound capture 
nucleotide sequences is preferably comprised between about 3 0 

30 and about 600 bases, preferably between about 40 and about 
400 bases and more preferably between about 40 and about 100 
bases. Longer nucleotide sequences can be used if they do not 
lower the binding yield of the target nucleotide sequences 
usually by adopting hairpin based secondary structure or by 

35 interaction with each other. 
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If the homology between the sequences to be 
detected is low (between 30 and 60%) , parts of the sequence 
which are specific in each sequence can be used for the 
design of specific capture nucleotide sequences binding each 
5 of the different target sequences. However, it is more 
difficult to find part of the sequence sufficiently conserved 
as to design "consensus" sequences which will amplify or copy 
all desired sequences. If one pair of consensus primers is 
not enough to amplify all the homologous sequences, then a 
10 mixture of two or more primers pairs is added in order to 
obtain the desired amplifications. The minimum homologous 
sequences amplified by the same consensus primer is two, nut 
there is no limitation to said number. 

If the sequences show high degree of homology, 
15 higher than 60% and even higher than 90%, then the finding of 
common sequence for consensus primer is easily obtained, but 
the choice for specific capture nucleotide sequences become 
more difficult. 

In another preferred embodiment of the 
20 invention, the capture nucleotide sequences are chemically 
synthesised oligonucleotides sequences shorter than 100 bases 
(easily performed on programmed automatic synthesiser) . Such 
sequences can bear a f unct ionalised group for covalent 
attachment upon the support, at high concentrations. 
25 Longer capture nucleotide sequences are 

preferably synthesised by PCR amplification (of a sequence 
incorporated into a plasmid containing the specific part of 
the capture nucleotide sequence and the non specific part 
(spacer) ) . 

30 In a further embodiment of the invention, the 

specific sequence of the capture nucleotide sequence is 
separated from the surface of the solid support by at least 
about 6.8 nm long, equivalent to the distance of at least 20 
base pair long nucleotides in double helix form. 
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In the method, kit (device) or apparatus 
according to the invention, the portion(s) (or part(ies)) of 
the capture nucleotide sequences complementary to the target 
is comprised between about 10 and about 60 bases, preferably 
5 between about 15 and about 40 bases and more preferably 
between about 2 0 and about 3 0 bases. These bases are 
preferably assigned as a continuous sequence located at or 
near the extremity of the capture nucleotide sequence. This 
sequence is considered as the specific sequence for the 

10 detection. In a preferred form of the invention, the sequence 
located between the specific capture nucleotide sequence and 
the support is a non specific sequence. 

In another embodiment of the invention, a 
specific nucleotide sequence comprising between about 10 and 

15 about 60 bases, preferably between about 15 and about 40 
bases and more preferably between about 2 0 and about 3 0 bases 
is located on a capture nucleotide sequence comprising a 
sequence between about 30 and about 600 bases. 

The method, kit (device) or apparatus according 

20 to the invention are suitable for the detection and/or the 
quantification of a target which is made of DNA or RNA, 
including sequences which are partially or totally homologous 
upon their total length. 

The method according to the invention can be 

25 performed even when a target present between an homology (or 
sequence identity) greater than 30%, greater than 60% and 
even greater than 80% and other molecules. 

In the method, kit (device) or apparatus 
according to the invention, the capture nucleotide sequences 

30 are advantageously covalently bound (or fixed) upon the 
insoluble solid support, preferably by one of their 
extremities as described hereafter. 

The method according to the invention gives 
significant results which allows identification (detection 

35 and quantification) with amplicons in solutions at 
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concentration of lower than about 10 nM, of lower than about 
1 nM, preferably of lower than about 0.1 nM and more 
preferably of lower than about 0.01 nM (= 1 fmole/100 /il). 

Another important aspect of this invention is 
5 to use very concentrate capture nucleotide sequences on the 
surface. If too low, the yield of the binding is quickly 
lower and is undetectable. Concentrations of capture 
nucleotide sequences between about 600 and about 3,000 nM in 
the spotting solutions are preferred. However, concentrations 

10 as low as about 100 nM still give positive results in 
favourable cases (when the yield of covalent fixation is high 
or when the target to be detected is single stranded and 
present in high concentrations) . Such low spotting 
concentrations would give density of capture nucleotide 

15 sequence as low as 20 fmoles per cm 2 . On the other side, 
higher density was only limited in the assays by the 
concentrations of the capture solutions, but concentrations 
still higher than 3,000 nM give good results. 

The use of these very high concentrations and 

20 long probes are two unexpected characteristic features of the 
invention. The theory of DNA hybridisation proposed that the 
rate of hybridisation between two DNA complementary sequences 
in solution is proportional to the square root of the DNA 
length, the smaller one being the limited factor (Wetmur, 

25 J.G. and Davidson, N. 1968, J. Mol . Biol. 3, 584). In order 
to obtain the required specificity, the specific sequences of 
the capture nucleotide sequences had to be small compared to 
the target. Moreover, the targets were obtained after PCR 
amplification and were double stranded so that they 

30 reassociate in solution much faster than to hybridise on 
small sequences fixed on a solid support where diffusion is 
low thus reducing even more the rate of reaction. It was 
unexpected to observe a so large increase in the yield of 
hybridisation with the same short specific sequence. 
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The amount of a target which "binds" on the 
spots is very small compared to the amount of capture 
nucleotide sequences present. So there is a large excess of 
capture nucleotide sequence and there was no reason to obtain 
5 the binding if even more capture nucleotide sequences. 

One may perform the detection on the full 
length sequence after amplification or copy and when 
labelling is performed by incorporation of labelled 
nucleotides, more markers are present on the hybridised 

10 target making the assay sensitive. 

The method, kit and apparatus according to the 
invention may comprise the use of other bound capture 
nucleotide sequences, which may have the same characteristics 
as the previous ones and may be used to identifying a target 

15 from another group of homologous sequences (preferably 
amplified by common primer (s) ) . 

In the microbiological field, one may use 
consensus primer (s) specific for each family, or genus, of 
microorganisms and then identify some or all the species of 

20 these various family in an array by using capture nucleotide 
sequences of the invention. Detection of other sequences can 
be advantageously performed on the same array (i.e. by 
allowing an hybridisation with a standard nucleotide sequence 
used for the quantification, with consensus capture 

25 nucleotide sequences for the same or different micro- 
organisms strains, with a sequence allowing a detection of a 
possible antibiotic resistance gene by micro-organisms or for 
positive or negative control of hybridisation) . Said other 
capture nucleotide sequences have (possibly) a specific 

30 sequence longer than 10 to 60 bases and a total length as 
high as 600 bases and are also bound upon the insoluble solid 
support (preferably in the array made with the other bound 
capture nucleotide sequences related to the invention) . A 
long capture nucleotide sequence may also be present on the 

35 array as consensus capture nucleotide sequence for 
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hybridisation with all sequences of the microorganisms from 
the same family or genus, thus giving the information on the 
presence or not of a microorganism of such family, genus in 
the biological sample. 
5 The same array can also bear capture nucleotide 

sequences specific for a bacterial group (Gram positive or 
Gram negative strains or even all the bacteria) . 

Another application is the detection of 
homologous genes from a consensus protein of the same 

10 species, such as various cytochromes P450 by specific capture 
nucleotide sequences with or without the presence of a 
consensus capture nucleotide sequence for all the cytochromes 
possibly present in a biological sample. Such detection is 
performed at the gene level by retrotranscription into cDNA. 

15 The solid support according to the invention 

can be or can be made with materials selected from the group 
consisting of gel layers, glasses, electronic devices, 
silicon or plastic support, polymers, compact discs, metallic 
supports or a mixture thereof (see EP 0 535 242, 

20 US 5,736,257, W099/35499, US 5,552,270, etc). Advantageously, 
said solid support is a single glass slide which may comprise 
additional means (barcodes, markers, etc.) or media for 
improving the method according to the invention. 

The amplification step used in the method 

25 according to the invention is advantageously obtained by well 
known amplification protocols, preferably selected from the 
group consisting of PCR, RT-PCR, LCR, CPT, NASBA, ICR or 
Avalanche DNA techniques. 

Advantageously, the target to be identified is 

30 labelled previously to its hybridisation with the single 
stranded capture nucleotide sequences. Said labelling (with 
known techniques from the person skilled in the art) is 
preferably also obtained upon the amplified sequence 
previously to the denaturation (if the method includes an 

35 amplification step) . 
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Advantageously, the length of the target is 
selected as being of a limited length preferably between 100 
and 200 bases, preferably between 100 and 400 bases and more 
preferably between 100 and 800 bases. This preferred 
5 requirement depends on the possibility to find consensus 
primers to amplify the required sequences possibly present in 
the sample. Too long target may reallocate faster and adopt 
secondary structures which can inhibit the fixation on the 
capture nucleotide sequences. 
10 Detection of genes is also a preferred 

application of this invention. The detection of homologous 
genes is obtained by first retrotranscription of the mRNA and 
then amplification by consensus primers as described in this 
invention . 

15 According to a further aspect of the present 

invention, the method, kit (device) or apparatus according to 
the invention is advantageously used for the identification 
of different Staphylococcus species or variant, preferably 
the S. aureus, the S. epidermidis, the S. saprophytics, the 

20 S. hominis or the S. haemolyticus for homologous organs 
present together or separately in the biological sample, said 
identification being obtained by detecting the genetic 
variants of the FemA gene in said different species, 
preferably by using a common locations in the FemA genetic 

25 sequence. 

Preferably, the primer (s) and the specific 
portions of said FemA sequence used for obtaining amplified 
products are the ones described hereafter in example 2. These 
primers have been selected as consensus primers for the 
30 amplification of the FemA genes of all of the 
16 Staphylococcus tested and they probably will amplify the 
FemA from all other possible Staphylococcus species. 

The detection of the 12 MAGE according to the 
invention is presented in figure 4 an 5 and in example 4. The 
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array allows to read the MAGE number by observation of the 
lines positive for signal bearing the specific capture 
probes ( example 5). 

The same application was developed for the 
5 Receptors Coupled to the G Proteins (RCGP) . These receptors 
bind all sort of ligands and are responsible for the signal 
transduction to the cytoplasm and very often to the nucleus 
by modulating the activity of the transcriptional factors. 
Consensus primers are formed for the various subtypes of RCGP 
10 for dopamine and for serotonine and histamine (figures 6 and 
7 and example 4) . The same is possible for the histamine and 
other ligands. 

The detection of the various HLA types is also 
one of the applications of the invention (figure 7 and 

15 example. HLA are homologous sequences which differ from one 
individual to the other. The determination of the HLA type is 
especially useful in tissue transplantation in order to 
determine the degree of compatibility between the donor and 
the recipient. It is also a useful parameter for 

20 immunisation. Given the large number of subtypes and the 
close relation between the homologous sequences it was not 
always possible to perfectly discriminate one sequence among 
all the other ones and for some of them there was one or two 
cross-reactions. In these cases, another capture probe was 

25 added on the array which gives a reaction with the sequence 
to be detected and another cross-reaction, in order to make 
the identification absolute. 

There are several forms of Cytochrome P4 5 0 
which are also homologous sequences. Figure 9 presents the 

30 design of the array to identify several cytochromes P450 
after retrotranscription and amplification with consensus 
primers (example 6) . 

The detection of polymorphism sequences (which 
can be considered as homologous even if differing by only one 

35 base) can be made also by the method according to the 
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invention. This is especially useful for the Cytochrome P450 
since the presence of certain isoforms modifies the 
metabolism of some drugs. 

Another aspect of the present invention is 
5 related to any part of biochips or microarray cpmprising said 
above described sequences (especially the -Specific capture 
nucleotide sequence described in the .examples) as well as a 
general screening method for tfrer identification of a target 
sequence specific of >a-±ti" microorganisms of family type 

10 descriminated ^ ^fxerrT "homologous sequences upon any type of 
mi c roar rays or biochips by any method. 

After hybridisation on the array, the target 
sequences can be detected by current techniques. Without 
labelling, preferred methods are the identification of the 

15 target by mass spectrometry now adapted to the arrays 
(US-A-5, 821, 060) or by intercalating agents followed by 
fluorescent detection (W097/27329 or Fodor et al . , Nature 364, 
p. 555 (1993) ) . 

The labelled associated detections are 

20 numerous. A review of the different labelling molecules is 
given in WO 97/27317. They are obtained using either already 
labelled primer or by incorporation of labelled nucleotides 
during the copy or amplification step. A labelling can also 
be obtained by ligating a detectable moiety onto the RNA or 

25 DNA to be tested (a labelled oligonucleotide, which is 
ligated, at the end of the sequence by a ligase) . Fragments 
of RNA or DNA can also incorporate labelled nucleotides at 
their 5' OH or 3' OH ends using a kinase, a transferase or a 
similar enzyme. 

30 The most frequently used labels are 

f luorochromes like Cy3 , Cy5 and Cy7 suitable for analysing an 
array by using commercially available array scanners (General 
Scanning, Genetic Microsystem, . . .) . Radioactive labelling, 
cold labelling or indirect labelling with small molecules 

35 recognised thereafter by specific ligands (streptavidin or 
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antibodies) are common methods. The resulting signal of 
target fixation on the array is either fluorescent, 
colorimetric , diffusion, electroluminescent, bio- or 
chemiluminescent , magnetic, electric like impedometric or 
5 voltametric (US-A-5 , 312 , 527) . A preferred method is based 
upon the use of the gold labelling of the bound target in 
order to obtain a precipitate or silver staining which is 
then easily detected and quantified by a scanner. 

Quantification has to take into account not 

10 only the hybridisation yield and detection scale on the array 
(which is identical for target and reference sequences) but 
also the extraction, the amplification (or copying) and the 
labelling steps . 

The method according to the invention may also 

15 comprise means for obtaining a quantification of target 
nucleotide sequences by using a standard nucleotide sequence 
(external or internal standard) added at known concentration. 
A capture nucleotide sequence is also present on the array so 
as to fix the standard in the same conditions as said target 

20 (possibly after amplification or copying) ; the method 
comprising the step of quantification of a signal resulting 
from the formation of a double stranded nucleotide sequence 
formed by complementary base pairing between the capture 
nucleotide sequences and the standard and the step of a 

25 correlation analysis of signal resulting from the formation 
of said double stranded nucleotide sequence with the signal 
resulting from the double stranded nucleotide sequence formed 
by complementary base pairing between capture nucleotide 
sequence (s) and the target in order to quantify the presence 

30 of the original nucleotide sequence to be detected and/or 
quantified in the biological sample. 

Advantageously the standard is added in the 
initial biological sample or after the extraction step and is 
amplified or copied with the same primers and/or has a length 

35 and a GC content identical or differing from no more than 20% 
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to the target. More preferably, the standard can be designed 
as a competitive internal standard having the characteristics 
of the internal standard found in the document W098/11253. 
Said internal standard has a part of its sequence common to 
5 the target and a specific part which is different. It also 
has at or near its two ends sequences which are complementary 
of the two primers used for amplification or copy of the 
target and similar GC content (W098/11253) . In the preferred 
embodiment of this invention, the common part of the standard 
10 and the target, means a nucleotide sequence which is 
homologous to all target amplified by the same primers (i.e. 
which belong to the same family or organisms to be 
quantified) . 

Preferably, the hybridisation yield of the 

15 standard through this specific sequence is identical or 
differ no more than 20% from the hybridisation yield of the 
target sequence and quantification is obtained as described 
in WO 98/11253. 

Said standard nucleotide sequence, external 

20 and/or internal standard, is also advantageously included in 
the kit (device) or apparatus according to the invention, 
possibly with all the media and means necessary for 
performing the different steps according to the invention 
(hybridisation and culture media, polymerase and other 

25 enzymes, standard sequence(s), labelling molecule(s), etc.). 

Advantageously, the biochips also contain spots 
with various concentrations (i.e. 4) of labelled capture 
nucleotide sequences. These labelled capture nucleotide 
sequences are spotted from known concentrations solutions and 

30 their signals allow the conversion of the results of 
hybridisation into absolute amounts. They also allow to test 
for the reproducibility of the detection. 

The solid support (biochip) can be inserted in 
a support connected to another chamber and automatic machine 

35 through the control of liquid solution based upon the use of 
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microfluidic technology. By being inserted into such a 
microlaboratory system, it can be incubated, heated, washed 
and labelled by automates, even for previous steps (like 
extraction of DNA, amplification by PCR) or the following 
5 step (labelling and detection) . All these steps can be 
performed upon the same solid support. 

The present invention will be described in 
details in the following non-limiting examples in reference 
to the enclosed figures. 

10 

Brief description of the drawings 

Figure 1 is a schematic presentation of the 
step used in the method of the invention for the 
identification of 5 Staphylococcus species on biochips after 

15 PCR amplification with consensus primers. 

Figure 2 represents the design of an array 
which allows the determination of the 5 most common 
Staphylococcus species, of the presence of any Staphylococcus 

strain and of the MecA gene. 
20 Figure 3 presents the effect of the length of 

the specific sequence of a capture nucleotide sequence on the 
discrimination between sequences with different level of 
homology. 

Figure 4 shows the sensitivity obtained for the 
25 detection of FemA sequences from S. aureus on array bearing 
the small specific capture nucleotide sequence for a S. 
aureus and a consensus sequence. 

Example 1: Detection of homologous FemA sequences on array 
30 bearing long specific capture nucleotide sequences (Fig. 3) 

Production of the capture nucleotide sequences and of the 
targets 
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The FemA genes corresponding to the different 
Staphylococci species were amplified separately by PCR using 
the following primers: 



10 
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s. 


aureus 1 : 


5 1 


s. 


aureus 2 : 


5 ' 


s. 


epi dermi dis 1 : 


5 ' 


s. 


epidermidis 2 : 


5 1 


s. 


haemolyticus 1 : 


5 1 


s. 


haemolyticus 2 : 


5 ' 


s. 


saprophyticus 1 : 


5 ' 


s. 


saprophyticus 2 : 


5 ' 


s. 


hominis 1 : 


5 ' 


s. 


hominis 2 : 


5 ' 



CTTTTGCTGATCGTGATGACAAA 3 1 
TTTATTTAAAATATCACGCTCTTCG 3 » 
TCGCGGTCCAGTAATAGATTATA' 3 ' 
TGCATTTCCAGTTATTTCTC^C 3 1 
ATTGATCATGGTATTGATAGATAC 3 ' 
TTTAATCTTTTTGAGTQTCTTATAC 3 ' 
TAAAATGAAACAACJCGGTTATAAG 3 1 
AAAC TAT C C AT A^ CAT T AAGT AC G 3 ' 
CGACCAGATA^lCAAAAAAGCACAA 3 1 
GTAATTCG^TACCATGTTCTAA 3 ' 



The PCR was performed in a final volume of 50 
fil containing: 1.5 mM MgCl 2 , /io mM Tris pH 8.4, 50 mM KC1 , 

0.8 (JVI of each primer, 50 tfk of each dNTP , 50 fiM of biotin- 
16-dUTP) , 1.5 U of Taq DJ^A polymerase Biotools, 7.5% DMSO, 5 
20 ng of plasmid containing FemA gene. Samples were first 

denatured at 94 °C fpr 3 min. Then 40 cycles of amplification 

/ 

were performed consisting of 30 sec at 94 °C, 30 sec at 60 °C 
and 30 sec at 12 °C and a final extension step of 10 min at 
72 °C. Water Controls were used as negative controls of the 
25 amplif icati^rh. The sizes of the amplicons obtained using 

these prittfers were 108 bp for S. saprophyticus, 139 bp for S. 

/ 

aureus bp for S. hominis, 101 pb for S. epidermidis and 

12 8 ^p for S. haemolyticus . The sequences of the capture 

/ 

nucleotide sequences were the same as the corresponding 
Amplicons but they were single strands. 
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The biochips also contains positive controls 
which were CMV ampl icons hybridised on their corresponding 
capture nucleotide sequence and negative controls which were 
capture nucleotide sequences for a HIV- I sequence on which 
5 the CMV could not bind. 

Capture nucleotide sequence immobilisation 

The protocol described by Schena et al (Proc. 
Natl Acad. Sci. USA 93, 10614 (1996)) was followed for the 

10 grafting of aminated DNA to aldehyde derivatised glass. The 
aminated capture nucleotide sequences were spotted from 
solutions at concentrations ranging from 150 to 3000 nM. The 
capture nucleotide sequences were printed onto the silylated 
microscopic slides with a home made robotic device (250 /urn 

15 pins from Genetix (UK) and silylated (aldehyde) microscope 
slides from Cell associates (Houston, USA)). The spots have 
400 fim in diameter and the volume dispensed is about 0,5 nl . 
Slides were dried at room temperature and stored at 4 °C 
until used. 

20 

Hybri di sat ion 

At 65 fil of hybridisation solution (AAT, Namur, 
Belgium) were added 5 pi of amplicons and the solution was 
loaded on the array framed by an hybridisation chamber. For 

25 positive controls we added 2 nM biotinylated CMV amplicons of 
437 bp to the solution; their corresponding capture 
nucleotide sequences were spotted on the array. The chamber 
was closed with a covership and slides were denatured at 95 
°C for 5 min. The hybridisation was carried out at 60° for 2 

30 h. Samples were washed 4 times with a washing buffer. 

Colorimetric detection 

The glass samples were incubated 45 min at room 
temperature with 800 /xl of streptavidin labelled with 
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colloidal gold 1000 x diluted in blocking buffer (Maleic 
buffer 100 mM pH 7.5, NaCl 150 mM, Gloria milk powder 0.1%). 
After 5 washes with washing buffer, the presence of gold 
served for catalysis of silver reduction using a staining 
5 revelation solution (AAT , Namur, Belgium) . The slides were 
incubated 3 times 10 min with 800 /il of revelation mixture, 
then rinsed with water, dried and analysed using a microarray 
reader. Each slides were then quantified by a specific 
quantification software. 

10 

Fluorescence detection 

The glass samples were incubated 45 min at room 
temperature with 800 ptl of Cyanin 3 or Cyanin 5 labelled 
streptavidin. After washing the slides were dried before 

15 being stored at room temperature. The detection was performed 
in the array-scanner GSM 418 (Genetic Microsystem, Woburn, 
MA, USA) Each slide was then quantified by a specific 
quantification software . 

The results give a cross-reaction between the 

20 species. For example, epidermidis amplicons hybridised on its 

capture probe give a value of 152, but give a value of 144, 
9, 13 and 20 respectively for the S. saprophyticus , 

S. aureus, S. haemolyticus and S. hominis capture probes. 

25 Example 2: Detection of homologous FemA sequences on array 

bearing small specific capture nucleotide sequences 

Protocols for capture nucleotide sequences 
immobilisation and silver staining detection were described 
in example 1 but the capture nucleotide sequences specific of 
30 the 5 Staphylococcus species were spotted at concentrations 

of 600 nM and are the following : 



23 




Name 


Sequence (5 ' - > 3 ' ) 


Capture 
nucleotide 
sequence 




ATaur02 

ATepi02 

AThae02 

AThomi>2^ 

ATSap02 


ATTTAA^ATATCACGCTCTTCGTTTAG 
AJTAAGCACATTTCTTTCATTATTTAG 
ATTTAAAGTTTCACGTTCATTTTGTAA 
ATTTAATGTCTGACGTTCTGCATGAAG 
ACTTAATACTTCGCGTTCAGCCTTTAA 



In this case, the targets are fragments of the 
FemA gene sequence corresponding to the different 
5 Staphylococci species which were amplified by a PCR using the 
following consensus primers : " 
APstap03: 5' CCCACTCGCJTTATATAGAATTTGA 3» 
APstap04: 5 ' .CCACfAGCGTACATCAATTTTGA 3' 
APst^pO^ ! 5 ' GGTTTAATAAAGTCACCAACATATT 3 ' 
10 ^ This PCR was performed in a final volume of 100 

111 containing: 3 mM MgCl2, 1 mM Tris pH 8, 1 fiM of each 

primer, 2 00 |iM of dACTP, dCTP and dGTP, 15 0 |iM of dTTP, 5 0 /jlM 
of biotin-16-dUTP, 2,5 U of Taq DNA polymerase (Boehringer 
Mannheim, Allemagne) 1 U of Uracil-DNA-glycosylase heat 
15 labile (Boehringer Mannheim, Allemagne) , 1 ng of plasmid 
containing FemA gene. Samples were first denatured at 94 °C 

for 5 min. Then 40 cycles of amplification were performed 
consisting of 1 min at 94°C, 1 min at 50°C and 1 min at 72°C 
and a final extension step of 10 min at 72 °C. Water controls 
20 were used as negative controls of the amplification. The 
sizes of the amplicons obtained using these primers were 489 
bp for all species. Figure 4 shows only the results obtained 

with the amplicons for S. epidermidis and S. xylosus. 
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The hybridisation solution was prepared as in 
example 1 and loaded on the slides. Slides were denatured at 
98°C for 5 min. Hybridisation are carried out at 50°C for 2h. 
Samples are then washed 4 times with a washing buffer. The 
5 values were very low and almost undetectable. 



Example 3: Effect of the spacer length on the Sensitivity of 
detection of homologous FemA sequences on array bearing long- 
capture nucleotide sequences with a small specific sequence 

10 The experiment was conducted as described in 

example 2 with the same ampl icons but the capture nucleotide 
sequences used are the following: 



. . *. \ 



Name 



Capture 

nucleotide 

sequence 

Ataur02 

ATepi02 
ATepi03 



ATepi04 



ATepiOS 



Sequence (5 ' - > 3 ' ) 



ATTTAAAATATCACGCTCTTCGTTTAG 
ATTAAGCACATTTCTTTCATTATTTAG, 

y 

GAATTCAAAGTTGCTGAGAA^ X 
ATTAAGCACATTTCTTTCATTATTTAG 

GAATTCA?(AGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 
ATTAAGCACATTTCTTTCATTATTTAG 

GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCGTCT 

TCTTAAAATCTAAAGAA 

ATTAAGCACATTTCTTTCATTATTTAG 



a The spacer sequences are underlined 
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The target amplicons were 489 bp long while the 
capture nucleotide sequences were 47, 67 or 87 bases single 
stranded DNA with a specific sequence of 27 bases. 

5 Example 4: Specificity of the detection of FemA sequences 
from different bacterial species on the same array bearing 
long capture nucleotide sequences with a small specific 
sequence 

The experiment was conducted as described in 
10 example 2 but the capture nucleotide sequences were spotted 
at concentrations of 3000 nM and are the following: 



A \ 



Name 


Sequence (5 ' -> 3 ' ) 


Capture 
nucleotide 




sequence 




Ataur27 


GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAdGAAGCG 
ATTTAAAATATCACGCTCTTCGTTTAG y " 


Atepi27 


y 

GAATTCAAAGTTGCTGAGAATAGT'f CAATGGAAGGAAGCG 

attaagcacatttctttcattXtttag 

/ 


Athae27 


gaattcaaagttgctCIagaatagttcaatggaaggaagcg 
atttaaagtttqAcgttcattttgtaa 


Athom2 7 


gaattg?(aagttgctgagaatagttcaatggaaggaagcg 
atttAatgtctgacgttctgcatgaag 


At sap 2 7 


gaattcaaagttgctgagaatagttcaatggaaggaagcg 
acttaatacttcgcgttcagcctttaa 



a Tfie spacer sequence is underlined. The specific sequences 
^ere of 27 bases 
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The targets are fragments of the FemA gene 
sequence corresponding to the different Staphylococci species 
which were amplified by PCR using the following consensus 
primers : 

APcons3-l: 5' TAAYAAARTCACCAACATAYTC 3' 
APcons3-2: 5' TYMGNTCATTTATGGAAGATAC 3' 
^ ~"' A consensus sequence is present on the biochips 

which detects all the tested Staphylococcus species. All 

10 target sequences were amplified by PCR with the same pair of 
primers . 

The size of the amplicons obtained using these 
primers were 587 bp for all species. The consensus sequence 
capture probe was a 489 base long single stranded DNA 
15 complementary to the amplicons of S. hominis as amplified in 

example 2. The detection was made in fluorescence. Homology 
between the consensus capture probe and the sequences of the 
femA from the 15 S. species were between 66 and 85%. All the 
sequences hybridized on this consensus capture probe. 

20 

Example 5: effect of the length of the svecific sequence of 
the capture nucleotide sequence on the discrimination between 
homologous sequences (figure 3). 

The experiment was conducted as described in 
25 example 4 but at a temperature of 43 °C and the capture 
nucleotide sequences used are presented in the table here 
joined. The numbers after the names indicate the length of 
the specific sequences. 

The FemA amplicons of S. anaerohius (a 

30 subspecies of S. aureus) were hybridised on an array bearing 
capture nucleotide sequences of 67 single stranded bases with 
. either 15, 27 and 40 bases specific for the S. aureus, 

anaerobius and epidermidis at their extremities. The 
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difference between the capture nucleotide sequences of 
anaerobius and aureus was only one base in the 15 base 
capture nucleotide sequence and 2 in the 27 and the 40 bases. 



Sequence (5 ' - > 3 ' ) 



/ 



Capture 

nucleotide 

sequence 

Ataurl5 



Ataur2 7 

Ataur4 0 

Atanal5 

Atana2 7 

Atana4 0 
Atepil5 
Atepi27 



At^epi4 0 



GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGC^CTT 



CTTAAAATGCTCTTCGTTTAGTT 



GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 



ATTTAAAATATCGCTCTTCGTTTAG 



GAATTCAAAGTTGCTGAGAATAGTTCA XATCTTTATTTAAAATA 



TCACGCTCTTCGTTTAGTTCTTT / 



GAATTCAAAGTTGCTGAGAZ^AGTTCAATGGAAGGAAGCGTCTT 



CTTAAAAT GCTCTTCATffTAGTT 



/ 



GAATTCAAAGTTjgCTGAGAATAGTTCAATGGAAGGAAGCG GTTT 



AAAATATCAGGCTCTTCATTTAG 

/ 

/ 

GAATT^AAAGTTGCTGAGAATAGTTCAA ATCTTTGTTTAAAATA 



TC^CGCTCTTCATTTAGTTCTTT 

gAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCGTCTT 



CTTAAAAT TTTCATTATTTAGTT 

GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 
ATTAAGCACATTTCTTTCATTATTTAG 



GAATTCAAAGTTGCTGAGAATAGTTCA AATCTTTATTAAGCACA 
TTTCTTTCATTATTTAGTTCCTC 
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Example 6: Sensitivity of the detection of FemA sequences/of 
Staphylococcus aureus on arrays bearing specific sequence as 
proposed by this invention and the consensus sequence (figure 
41 

5 The experiment was conducted as described in 

example 4 with the capture nucleotide sequences spotted at 
concentrations of 3000 nM. The bacterial FemA' ' sequences were 
serially diluted before the PCR and being incubated with the 
arrays . 

Example 7: Detection of 16 homologous F emA sequences on array 

The consensus primers/' and the ampl icons were 
the same as described in the example 4 but the capture probes 
were chosen for the identification of 15 Staphylococcus 

15 species. The experiment is c^ducted as in example 4. The 

/ 

capture probes contain a spacer fixed on the support by its 

5" end and of the following sequence 5' 

GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 3' followed by the 

following specific sequences for the various femA from the 
20 different Stphylococgus . 

S. aureus ATTTAA&ATATCACGCTCTTCGTTTAG 

/ 

S . epidermidis ATTAAGCACATTTCTTTCATTATTTAG 
S. haemolyti?Us ATTTAAAGTTTCACGTTCATTTTGTAA 
25 S. hominis /ATTTAATGTCTGACGTTCTGCATGAAG 

S. saprophytics ACTTAATACTTCGCGTTCAGCCTTTAA 

S. capitis ATTAAGAACATCTCTTTCATTATTAAG 

S. /fcaseolyticus ATAAAGACATTCGAGACGAAGGCT 
30 ^ cohnii ACTTAACACTTCACGCTCTGACTTGAG 

/S. gallinarum ACTTAAAACTTCACGTTCAGCAGTAAG 
/ S. intermedius GTGGAAATCTTGCTCTTCAGATTTCAG 
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y 
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S. lugdunensis TTCTAAAGTTTGTCGTTCATTCGTTAG 
S. schleiferi TTTAAAGTCTTGCGCTTCAGTGTTGAG 
S. sciuri GTTGTATTGTTCATGTTCTTTTTCTAA 
S. simulans TTCTAAATTCTTTTGTTCAGCGTTCAA 
S. warneri AGTTAAGGTTTCTTTTTCATTATTGAG 
S. xylosus GCTTAACACCTCACGTTGAGCTTGCAA 

Example 8; Detection of 19 homoqous v34 Sequences of 
Mycobacteria 



The P3 4 genes present in ,all Mycobacteria are 
\^ij^f 1 all amplified with the following consensus primers 



Sens 

15 MycU4 5' CATGCAGTGAATTAGAACGT 3 ' located at the position 496- 
515 of the gene, Tm = 56°C 



Antisens 

20 APmcon02 5' GTASGTGATRRSTYCTCC 3' located at the position 

position 733-750 pf the gene, Tm = 52-58°C 

/ 

S = C or G 
R = A or G 
Y = T or C 

25 The size of amplified products ranges from 123 

to 258 bp 

The following capture probes have been chosen 
fQf the specific capture of the Mycobacteria sequences. 

30 Capture probes 

Avium : 5' CGGTCGTCTCCGAAGCCCGCG 3» (21 nt) 

/ Gastrii 1 : 5' GATCGGCAGCGGTGCCGGGG 3' (20 nt) 

/ 
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Gastrii 3 
Gastrii 5 
Gastrii 7 
Gordonae 



5 ' 
5 ■ 
5 ' 
5 > 



Intracellulare 1 : 
Intracellulare 3 



Leprae : 
Kansasi 1 
Kansasi 3 

10 Kansasi 5 
Malmoense 
Marinum : 
Scrofulaceum 
Simiae : 

15 Szulgai : 

Tuberculosis : 
Xenopi : 



5 ' 
5 ' 
5 ' 
5 ' 
5 ' 
5 ' 

5 ' 
5 ' 



GTATCGCGGGCGGCAAGGT 3 ' ( 1 9 nt ) 
TCTGCCGATCGGCAGCGGTGCCGG 3' (2 4nt) 
GCCGGGGCCGGTATTCGCGGGCGG 3' (2 4nt) 
G ACGGG C AC T AGT TGT C AG AGG 3' (22 nt) 

5" GGGCCGCCGGGGGCCTCGCCG 3' (21 nt) 
: 5* GCCTCGCCGCCCAAGACAGTG 3' (21 nt) 
GATTTCGGCGTCCATCGGTGGT 3' (22 nt) 
GATCGTCGGCAGTGGTGACGG 3' (21 nt) 
TCGTCGGCAGTGGTGAC 3 ' ( 1 7 nt ) 
ATCCGCCGATCGTCGGCAGTGGTGACG 3' (27 nt) 
GACCCACAACACTGGTCGGCG 3' (21 nt) 
CGGAGGTGATGGCGCTGGTCG 3 ' ( 2 1 nt ) 

5' CGGCGGCACGGATCGGCGTC (2 0 nt) 
ATCGCTCCTGGTCGCGCCTA 3 1 (2 0 nt) 
CCCGGCGCGACCAGCAGA^G 3' (21 nt) 

5' GCCGTCCAGTCpT TAATGTCGC 3' (22 nt) 
CGGTAGAAGCTGCGATGACACG 3' (22 nt) 



Each of the sequences above comprises a spacer 

20 at its 5' end 

Spacer sequence 5 ' GAAT TCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 
3 ' 

Capture probes are aminated at their 5' end. 
25 Example 9; Detection of MAGE genes 

MAGE genes are all amplified with the following 
consensus primers 
Sens 

30 - DPSCONS2 5' GGGCTCCAGCAGCCAAGAAGAGGA 3', located at the 
398^421 position of the gene 
Tm = 78 °C 

Other ampl icons have been added as sense primer 
in order to increase the efficiency of the PCR for some MAGEs 
$5 - DPSMAGE1 5' GGGTTCCAGCAGCCGTGAAGAGGA 3 ' Tm = 78°C 
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- DPSMAG8 5 ! GGGTTCCAGCAGCAATGAAGAGGA 3' Tm = 74 °C / 

- DPSMAG12 5' GGGCTCCAGCAACGAAGAACAGGA 3' Tm = 76 °C 

Ant is ens e 

5 - DPASC0NB4 5 ■ CGGTACTCCAGGTAGTTTTCCTGC 3 ' , located at the 
position 913-936 of the gene, Tm = 74°C 
The size of the amplified products is around 53.0 bp 



h 



J 



The following capture probes of 27 nucleotides 
10 have been chosen for the specific cpature of the MAGE 
sequences . 

Capture probes 

/ 

Mage 1 DTAS01 5* AC AAGGAC T C C AGG/^T AC AAGAGGTGC 3' Mage 2 
-DTAS02 5' ACTCGGACTCCAGGTCGG^AAACATTC 3' 

X 15 Mage 3 DTS03 0 6 5' AAGACAGTAT^TGGGGGATCCCAAGA 3' 
Mage 4 DTAS04 5 ■ TCGGAACA&GGACTCTGCGTCAGGCGA 3 ■ 
Mage 5 DTAS0 5 5' GCTCGG^CACAGACTCTGGGTCAGGG 3' 
Mage 6 DTS06 5' C AAG^C AGG C T T C C T G AT AAT CAT C C T 3' 

Mage 7 DTAS0 7 5' AGJ2&CGCCAGGTGAGCGGGGTGTGTCT 3' 

20 Mage 8 DTAS08 5' QGGACTCCAGGTGAGCTGGGTCCGGGG 3' 

/ 

Mage 9 DTAS0 9 5' / TGAACTCCAGCTGAGCTGGGTCGACCG 3' 
Mage 10 DTAS10 5,/' TGGGTAAAGACTCACTGTCTGGCAGGA 3' 
Mage 11 DTAS115' GAAAAGGACTCAGGGTCTATCAGGTCA 3' 
Mage 12 DTAS12 5' GTGCTACTTGGAAGCTCGTCTCCAGGT 3' 

25 

Each of the sequences above comprises a spacer 
aminated at its 5' end in order to be covalently linked to 
the glass 

Spacer sequence 
30 5 ' GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 3 ' 

They are spotted on aldehyde bearing glasses 
and used for the detection of the MAGEs amplified by the 
consensus primers given here above. The results show a non 
equivocal identification of the MAGEs present in the tumors 
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compared to identification using 12 specific PCR, one foy / 
each MAGE sequences . / 

/ 

Example 10: Identification of G-protein dopamin receptors 
5 subtypes in rat 

Dopamine Receptor coupled to the G-protein are 
all amplified with the following consensus primers 

Sens 

10 - C0NSENSUS2 -3-4 

5" TG C AG ACMAC C AC C AAC T AC T T 3' located at the position 221-242 

of the gene, Tm = 6 6 °C 
M = A or C 

15 - CONSENSUS1-5 

5' TGMGGKCCAAGATGACCAACWT , i ' (22 nt ) located at the position 

221-240 of the gene, Tm = 66°C 
M = A or C 
K = G or T 
20 W = A or T 

An ti sens 

5 TCATGRCRQASAGGTTCAGGAT 3' located at the position 395-416 

of the gene, Tm = 64-68°C 
25 R = A or G 

S = C or G 

The size of the amplified product is 196 pb. 

The following capture probes of 27 nucleotides 
30 have been chosen for the specific capture of the dopamine 
/ receptor sequences. 
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Capture probes 

DRD1 5 1 CTGGCTTTTGGCCTTTGGGTCCCTTTT 3 ' DRD2 5 ' 

TGATTGGAAATTCAGCAGGATTCACTG 3 ' DRD3 5 ' 

GAGTCTGGAATTTCAGCCGCATTTGCT 3 ' DRD4 5 ■ 

5 CGTCTGGCTGCTGAGCCCCCGCCTCTG 3 ' DRD5 5 ' 
CTGGGTACTGGCCCTTTGGGACATTCT 3 ' 

Each of the sequences above comprises an 

aminated spacer at its 5' end. Spacer sequence 5' 
1 0 GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 

Example 11; Identification of G-protein histamin receptors 



J^S "\ subtypes in rat 



Histamin Receptor coupled to the G-protein are 
15 all amplified with the following primers 

Sens 

- Hlsens 

5' CTCCGTCCAGCAACCCCT 3* (18 nt) located at the Position 381- 
20 398 of the gene, Tm ^- 60°C 

- H2sens 

5* CTGTGCTGGTCACCCCAGT 3' (18 nt) located at the Position 
380-398 of the gene, Tm = 62°C 

25 

- H3sens / 

5' ACTQATCAGCTATGACCGATT 3 ' (21 nt) located at the Position 
378-398 of the gene, Tm = 60°C 

30 Antisens 

- Hlant isens 

5' ACCTTCCTTGGTATCGTCTG 3 ' (20 nt) located at the Position 
722-741 of the gene, Tm = 60°C 
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- H2antisens 

5' G AAA C C AG C AG AT G ATG AA C G 3' (21 nt) located at the Positio^ 
722-742 of the gene, Tm = 62 °C 

5 - H3antisens 

5' GCATCTGGTGGGGGTTCTG 3' (19 nt) located at the Position 722- 
740 of the gene, Tm = 62°C 

Size of the amplified product ranges from 359 

10 to 364 pb. 

The following capture probes have been chosen 
for the specific capture of the histamin receptor sequences. 

Capture probes 

HI 5' CCCCAGGATGGTAGCGGA 3' (18 nt) 
15 H2 5 ' AGGATAGGGTGATAGAAATAAC 3' (2? nt) 
H3 5' TCTCGTGTCCCCCTGCTG 3' (18 /it) 

Each of the sequences above comprises a spacer 

at its 5' end 

20 Spacer sequence 5 ' 

GAATTCAAAGTTGCTGAGAATA'GTTCAATGGAAGGAAGCG 3 ' . Capture probes 
. are aminated at their 5' end. 



25 Example 12 : Identification of G-protein serotonin receptors 

subtypes in rat 

Serotonin Receptor coupled to the G-protein are 
all amplified with the following primers 
Sens 

30 - Consensus for the subtypes 1A, IB, 1C, ID, IE, 2A, 2B, 2C, 
4/6, 7 

4 ' ATCHTGCACCTSTGBGBCAT 3' Tm = 58-64°C (20 nt) 

/ H = C or A or T 
/ S = C or G 
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Mi-.. ' 

AW 



10 



15 



20 



25 





• 






• 






B = 


= C or T or G 












1A 


ATCCTGCACCTGTGCGCCAT 


(0 


mismatch) 


position 


370 


-389 


IB 


ATCATGCATCTCTGTGTCAT 


(1 


mismatch) 


position 


397 


-416 


1C 


ATCATGCACCTCTGCGCCAT 


(0 


mismatch) 


position 


427 


-446 


ID 


ATCCTGCATCTCTGTGTCAT 


(1 


mismatch) 


position 


3^7 


-386 


IE 


ATCTTGCACCTGTCGGCTAT 


(2 


mismatch) 


position 


331 


-350 


2A 


ATCATGCACCTCTGCGCCAT 


(0 


mismatch) 


position 


487 


-506 


2B 


ATCATGCATCTCTGTGCCAT 


(1 


mismatch) 


position 


424 


-443 


2C 


ATCATGCACCTCTGCGCCAT 


(0 


mismatch) 


position 


24- 


43 


4 


ATTTTTCACCTCTGCTGCAT 


(3 


mismatch^) 








6 


ATCCTCAACCTCTGCTTCAT 


(3 


mismatvChs) 








7 


ATCATGACCCTGTGCGTGAT 


(3 


misruatchs) 









- Consensus 4, 6 

5' ATCYTYCACCTCTGCYKCAT .,3 ■ Tm = 52-64°C (20 nt) 

K = G or T 

Y = T or C 

4 ATTTTTCACCTCTGCTGCAT (1 mismatch) position 322-341 
6 ATCCTCAACCTCTGCCTCAT (1 mismatch) position 340-359 

- Consensus SA, 5B 

5' ATCTGGAAYGTGRCAGCCAT 3 ' Tm = 58-62°C (20 nt) 

Y = T or C 
R = A or G 

5 A ATCTGGAATGTGACAGCAAT (1 mismatch) position 385-404 
5B ATCTGGAACGTGGCGGCCAT (1 mismatch) position 424-443 
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- Specifique 7 

5 ' ATCATGACCCTGTGCGTGAT 3 



Tm 



56°C (18 nt) position 517-536 



- Specifique 3B 

5' CTTCCGGAACGATTAGAAA 3 * Tm = 54°C (19 nt ) position 404-422 
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Antisens 

- Consensus for the subtypes 1A, IB, 1C, ID, IE, 2A, 2B, 2C, 





4, 


7 Tm = 48-58 °C 












5 ' 


TTGGHNGCYTTCYGBTC 


3 ' 








5 


H = 


= A or T or C 












N = 


= A or C or G or T 












B = 


= C or T or G 












1A 


TTCACCGTCTTCCTTTC 


(4 


mismatchs) 








IB 


TTGGTGGCTTTGCGCTC 


(1 


mismatch) 


position 913- 


929 


10 


1C 


TTGGAAGCTTTCTTTTC 


(1 


mismatch) 


position 922- 


938 




ID 


TTAGTGGCTTTCCTTTC 


(2 


mismatchs) 


position 877 


-893 




IE 


GTGGCTGCTTTGCGTTC 


(2 


mismatchs) 


position 862 


-878 




2A 


TTGCACGCCTTTTGCTC 


(2 


mismatchs) 


positioi^ 952 


-968 




2B 


TTTGAGGCTCTCTGTTC 


(2 


mismatchs) 


position 952 


-968 


15 


2C 


TTGGAAGCTTTCTTTTC 


(1 


mismatch) 


position 424- 


440 




4 


TTGGCTGCTTTCCGGTC 


(2 


mismatchs) 








7 


GTGGCTGCTTTCTGTTC 


(1 


mismatch) 


position 973- 


989 



- Specific 1A 

20 5' TTCACCGTCTTCCTTTC 3' Tm =/50°C (17 nt) position 1018-1034 

- Specific 4 

5 , TCTTGGCXGCTTXGGTC 31 Tm = 52°C (17 nt) position 762-778 

25 - Specific 6 

5' ATAAAGAGCGGGTAGATG 3' Tm = 52 °C (18 nt) position 945-963 

- Consensus 5A, 5B 

5' CCTTCTGCTCCCTCCA 3 ' Tm = 52°C (16 nt) 

30 

5A CCfTTCTGTTCCCTCCA (1 mismatch) position 823-840 

5B CCTTCTGCTCCCGCCA (1 mismatch) position 862-879 

- Specific 3B 

^5 5' ACCGGGGACTCTGTGT 3 ■ Tm = 52°C (16 nt ) position 1072-1089 
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The following capture probes have been chosen 
for the specific capture of the serotonin receptor subtypes 
sequences . 



5 


Capture probes 






HTR1C 


5 ' 


C T ATG C T C AAT AGG AT T AC GT 


3' (21 nt) 




HTR2A 


5 ' 


GTGGTGAATGGGGTTCTGG 3 1 


(19 nt) 




HTR2B 


5 ' 


TGGCCTGAATTGGCTTTTTGA 


3' (21 nt) 




HTR2C/1C 


5 ' TTATTCACGAACACTTTGCTTT 3 ' (22 


J. u 


HTR1B 


5 ' 


AATAGTCCACCGCATCAGTG I 


I ' (20 nt) 




HTR1D 


5 ' 


GTACTCCAGGGCATCGGTG 3 ' 


(19 nt) 




HTR1A 


5 ' 


CATAGTCTATAGGGTCGGTG 3 


1 (20 nt) 




HTR1E 


5 ' 


ATACTCGACTGCGTCTGTGA 3 


' (2 0 nt) 




HTR7 


5 ' 


GTACGTGAGGGGTCTCGTG 3 ' 


(19 nt) 


15 


HTR5A 


5 ' 


GGCGCGTTATTGACCAGTA 3 1 


(19 nt) 




HTR5B 


5 ' 


GGCGCGTGATAGTCCAGT 3 ' 


(18 nt) 




HTR3B 


5 ' 


GATATCAAAGGGGAAAGCGTA 


3' (21 nt) 




HTR4 


5 ' 


AAACCAAAGGTTGACAGCAG 3 


■ (20 nt) 




HTR6 


5 ' 


GTAGCGCAGCGGCGAGAG 3 ' 


(18 nt) 


20 











Each of the sequences above comprises a spacer 

at its 5 ' end 

Spacer" sequence 5 1 

GAATTCAAAGTTGCTGAQAATAGTTCAATGGAAGGAAGCG 3 ' . Capture probes 
25 are aminated at their 5' end. 

Example 13 : Identification of the HLA-A subtypes 

The HLA-A subtypes are amplified with the 
following consensus primers 
30 Sens 

IPSCOItfA 5' GACAGCGACGCCGCGAGCCA 3' located at the position 
181-200 of the gene, Tm = 70°C 
Ant i sens 

LPASCONA 5 CGTGTCCTGGGTCTGGTCCTCC 3' located at the position 
35^/735-754 of the gene, Tm = 74°C 
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The size of the amplified product is 574 bp 

The following capture probes of 27 nucleotides 
have been chosen for the specific capture of the HLA-A 
sequences 

5 ' GGAGGGCCGGTGCGTGGACGGGCTCCG 3 ' 
5 1 TCTCCCCGTCCCAATACTCCGGACCCT 3 ' 
5 1 CTGGGCCTTCACATTCCGTGTCTCCTG 3 ' 
5 ' AGCGCAAGTGGGAGGCGGCCCATGAGG 3 ' 
5 ■ GCCCATGCGGCGGAGCAGCAGAGAGCC 3 ■ 
5 ' CCTGGAGGGCCGGTGCGTGGAGTGGCT 3 ' 
5 ' GCCCGTGTGGCGGAGCAGTTGAGAGCC 3 ' 
5 ■ CCTTCACTTTCCCTGTCTCCTCGTCCC 3 ■ 
5 ■ GCCCATGTGGCGGAGCAGCAGAGAGCC 3 1 
5 ' TAGCGGAGCGCGATCCGCAGGTTCTCT 3 ' 
5 ■ TAGCGGAGCGCGATCCGCAGGCTCTCT 3 ' 
5 1 TCACATTCCGTGTGTTCCGGTCCCAAT 3 * 
5 ■ GGGT'CCCCAGGTTCGCTCGGTCAGTCT 3 1 
5 ' TGACATTCCGTGTCTGCAGGTCCCAAT 3 ' 
5 1 CGTAGGCGTGCTGTTCATACCCGCGGA 3 ' 

5 ' CCCAATACTCAGGCCTCTCCTGCTCTA 
5 ■ CGCACGGACCCCCCCAGGACGCATATG 3 • 
5_[ GGCGGCCCATGTGGCGGAGCAGTGGAG 31 

ITASA68B /5 1 GTCGTAGGCGTCCTGCCGGTACCCGCG 3' 

25 HLA-A69 ITASA69 5' ATCCTCTGGACGGTGTGAGAACCGGCC 3' 

Each of the sequences above comprises an aminated spacer at 
its 5' end. Spacer sequence 5' 

GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 3 1 

30 Ekample 14; Identification of Cytochrome P450 3a forms 

The Cytochrome P4 5 0 forms are amplified with the 
f ollowingconsensus primers 



5 Capture probes 

HLA-A1 ITSA01 
HLA-A2 ITASA02 
HLA-A3 ITASA03A 
ITSA03B 
10 HLA-A11 ITSA11A 
ITSA11B 
HLA-A23 ITSA23A 

ITASA2 3B 
HLA-A2 4 ITSA2 4A 
"v. : 15 ITASA2 4B 

£\ 3 HLA-A2 5 ITASA2 5 A 

>-V^' ITASA2 5B 

HLA-A2 6 ITASA2 6 
^ HLA-A2 9 ITASA2 9 

20 HLA-A3 0 ITASA3 0 
HLA-A31 ITASA31 
3 ' HLA-A3 3 ITSA3 3 
HLA-A68 ITSA6 8A 
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10 



Sens 

- Consensus 

5' GCCAGAGCCTGAGGA 3' located at the position 1297-1311 of 
the 3a3 gene, Tm = 50°C 

Antisens 

- Consensus a3 , a23, al, a2 

5' T C AAAAG AAAT T AAC AG AG A 3' located at the position 1839-1858 
of the 3a3 gene, Tm = 50°C 

- Specific a9 

5" ACAATGAAGGTAACATAGG 3' located at the position 2015-2033 
of the 3a9 gene Tm = 52 °C 

15 - Specific al8 

^ A 5' ACTGATGGAACTAACTGG 3' located at the position 1830-1846 of 

• N 1$ v the 3al8 gene Tm = 52°C 

^ i ^ The length of the PCR product is around 560pb. 

v 

s 

20 The follbwing capture probes have been chosen 

for the specific capture of the cytochrome P-450 3a 
'Q- sequences . 

Capture probe 

25 3al 5' TGTTTTGATTCGGTACATCTTTG 3' (24 nt ) 
3a3 5' TTGATTTGGTACATCTTTGCT 3' (21 nt) 
3A9 .-5' ACTCCTGGGGGTTTTGGGTG 3' (20 nt) 
3A18 5' ATTACTGAGTATTCAGAAATTCAC 3' (24 nt) 
3A2 5' GGTTAAAGATTTGGTACATTTATGG 3' (25 nt) 
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Each of the sequences above comprises a spacer 

at its 5 ' end 
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Spacer sequence 5 1 

GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 3 ' . Capture probes 
are aminated at their 5' end. 



5 Example 15 ; Identification of GMQ on biochips 

Consensus primers to detect GMO on biochips; 
0GM1 CGTCTTCAAAGCAAGTGGATTG 
0GM2 ATCCTGTTGCCGGTCTTGCG 

These primers allow the amplification of the 

\ \ ") 10 genes : 

"V v * 1) CTP1, CTP2, CP4EPSPS, S CrylAb and hsp 70 Int. in Mon 809 
(corn, Monsanto) 

2) hsp 70 Int. and S CrylAb in Mon 810 (corn, Monsanto) 

3) S CrylAb and ,S Pat in Bt 11 (corn, Novartis) 

15 4) CTP4 and EPSPS in GTS40-3-2 (soybean, Monsanto) 

The capture probes will be chosen in these 
sequences to allow discrimination. Each of the sequences 
above comprises a spacer at its 5 T end 

Spacer sequence 5' GAATTCAAAGTTGCTGAGAATAGTTCAATGGAAGGAAGCG 
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